Automatic Evaluation of Dysarthric Speech and Telemedical Use in the Therapy

نویسندگان

Elmar Nöth

Andreas Maier

Arnd Gebhard

Tobias Bocklet

Wilfried Schupp

Maria Schuster

Tino Haderlein

چکیده

After a stroke, the speech quality of the patients is often reduced. This is usually caused by a deficit of the motor abilities of the vocal tract. The result is slurred speech. In the various patients, however, very different forms can appear. In the course of therapy, evaluation of the speech quality is required to determine the success of the treatment. At the moment, this assessment is performed only perceptually. This form of assessment is subject to strong intraand inter-individual variation. Therefore, an “objective” assessment is not guaranteed. In this study, we present a rater-independent method for evaluating speech disorders in dysarthria. We use methods of automatic speech recognition. The idea is to determine the speech intelligibility – the main outcome parameter of speech – automatically by an automatic speech recognizer. A correlation of -0.89 was obtained between the criterion “intelligibility” and the recognition rate of the automatic system, in a preliminary study. The second part of this paper deals with an additional problem with this kind of patient. Very often, the stroke leads to partial facial paresis and generally to reduced mobility. Therefore, it is desirable that therapy sessions are performed in a telemedical setup. We report on our work towards such a telemedical diagnosis and rehabilitation system which will allow sessions with a therapist and – at the same time – diagnose the patient and track the recovery process. We describe the equipment (web camera, 3-D camera, stereo microphone, and Internet connection), the patient's environment, and the working environment of the therapist. Depending on the network connection, live images of the patient can be sent to the therapist at a rate of 20 frames per second (fps) at existing LAN connections or 3 fps with a DSL 6000 connection. At the patient host, a three-dimensional face model of the patient performing a facial exercise can be generated and transferred to the therapist in real-time (LAN) or three times real-time (DSL 6000).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improvement of Continuous Dysarthric Speech Quality

Dysarthria refers to a group of motor speech disorders as the result of any neurological injury to the speech production system. Dysarthric speech is characterised by poor speech articulation, resulting in degradation in speech quality. Hence, it is important to correct or improve dysarthric speech so as to enable people having dysarthria to communicate better. The aim of this paper is to impro...

متن کامل

An ASR-Based Interactive Game for Speech Therapy

The demand for intensive and costly speech therapy to patients impaired by communicative disorders can potentially be alleviated by developing computer-based systems that provide automatized speech therapy in the patient’s home environment. In this paper we report on research aimed at developing such a system that combines serious gaming with automatic speech recognition (ASR) technology to pro...

متن کامل

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...

متن کامل

Automatic Prediction of Speech Evaluation Metrics for Dysarthric Speech

During the last decades, automatic speech processing systems witnessed an important progress and achieved remarkable reliability. As a result, such technologies have been exploited in new areas and applications including medical practice. In disordered speech evaluation context, perceptual evaluation is still the most common method used in clinical practice for the diagnosing and the following ...

متن کامل

Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech

Perceptive evaluation of speech disorders is still the standard method in clinical practice for the diagnosing and the following of the condition progression of patients. Such methods include different tasks such as read speech, spontaneous speech, isolated words, sustained vowels, etc. In this context, automatic speech processing tools have proven pertinence in speech quality evaluation and as...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Automatic Evaluation of Dysarthric Speech and Telemedical Use in the Therapy

نویسندگان

چکیده

منابع مشابه

Improvement of Continuous Dysarthric Speech Quality

An ASR-Based Interactive Game for Speech Therapy

A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation

Automatic Prediction of Speech Evaluation Metrics for Dysarthric Speech

Automatic Anomaly Detection for Dysarthria across Two Speech Styles: Read vs Spontaneous Speech

عنوان ژورنال:

اشتراک گذاری